Skip to main content

VectorDB

What are vector databases ?

So basically, vector DBs store not semantic variables or index single variables or values but rather embeddings are indexed.

What is Embeddings?

How it works

Embedding step:

You take some data — text, images, audio, even proteins.

Feed it through a model (like OpenAI’s text-embedding-3-small).

The model spits out a vector: a numerical “fingerprint of meaning.”

Example:

“cat” → [0.1, 0.9, -0.3, …]

“dog” → [0.2, 0.95, -0.25, …]

“car” → [0.88, -0.1, 0.45, …]

Notice how “cat” and “dog” are closer than “cat” and “car.”

Storage step:

You put all those vectors into a vector DB (HelixDB, Pinecone, Weaviate, Milvus).

It organizes them in a way that makes similarity search fast, even if you have millions.

Query step:

When you ask a question (say, “small house pet that purrs”), the system embeds your query into a vector.

The DB finds which stored vectors are closest in this high-dimensional space.

Closeness = semantic similarity.